智能论文笔记

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Artificial Intelligence Security Competition (AISC)

Yinpeng Dong , Peng Chen , Senyou Deng , Lianji L , Yi Sun , Hanyu Zhao , Jiaxing Li , Yunteng Tan , Xinyu Liu , Yangyi Dong

分类：人工智能 | 计算机视觉 | 机器学习

2022-12-07

The security of artificial intelligence (AI) is an important research area towards safe, reliable, and trustworthy AI systems. To accelerate the research on AI security, the Artificial Intelligence Security Competition (AISC) was organized by the Zhongguancun Laboratory, China Industrial Control Systems Cyber Emergency Response Team, Institute for Artificial Intelligence, Tsinghua University, and RealAI as part of the Zhongguancun International Frontier Technology Innovation Competition (https://www.zgc-aisc.com/en). The competition consists of three tracks, including Deepfake Security Competition, Autonomous Driving Security Competition, and Face Recognition Security Competition. This report will introduce the competition rules of these three tracks and the solutions of top-ranking teams in each track.

translated by 谷歌翻译

Heterogeneous Randomized Response for Differential Privacy in Graph Neural Networks

Khang Tran , Phung Lai , NhatHai Phan , Issa Khalil , Yao Ma , Abdallah Khreishah , My Thai , Xintao Wu

分类：机器学习

2022-11-10

Graph neural networks (GNNs) are susceptible to privacy inference attacks (PIAs), given their ability to learn joint representation from features and edges among nodes in graph data. To prevent privacy leakages in GNNs, we propose a novel heterogeneous randomized response (HeteroRR) mechanism to protect nodes' features and edges against PIAs under differential privacy (DP) guarantees without an undue cost of data and model utility in training GNNs. Our idea is to balance the importance and sensitivity of nodes' features and edges in redistributing the privacy budgets since some features and edges are more sensitive or important to the model utility than others. As a result, we derive significantly better randomization probabilities and tighter error bounds at both levels of nodes' features and edges departing from existing approaches, thus enabling us to maintain high data utility for training GNNs. An extensive theoretical and empirical analysis using benchmark datasets shows that HeteroRR significantly outperforms various baselines in terms of model utility under rigorous privacy protection for both nodes' features and edges. That enables us to defend PIAs in DP-preserving GNNs effectively.

translated by 谷歌翻译

Attention Spiking Neural Networks

Man Yao , Guangshe Zhao , Hengyu Zhang , Yifan Hu , Lei Deng , Yonghong Tian , Bo Xu , Guoqi Li

分类：计算机视觉

2022-09-28

从大脑的事件驱动和稀疏的尖峰特征中受益，尖峰神经网络（SNN）已成为人工神经网络（ANN）的一种节能替代品。但是，SNNS和ANN之间的性能差距很长一段时间以来一直在延伸SNNS。为了利用SNN的全部潜力，我们研究了SNN中注意机制的影响。我们首先使用插件套件提出了我们的注意力，称为多维关注（MA）。然后，提出了一种新的注意力SNN体系结构，并提出了端到端训练，称为“ ma-snn”，该体系结构分别或同时或同时延伸了沿时间，通道以及空间维度的注意力重量。基于现有的神经科学理论，我们利用注意力重量来优化膜电位，进而以数据依赖性方式调节尖峰响应。 MA以可忽略的其他参数为代价，促进了香草SNN，以实现更稀疏的尖峰活动，更好的性能和能源效率。实验是在基于事件的DVS128手势/步态动作识别和Imagenet-1K图像分类中进行的。在手势/步态上，尖峰计数减少了84.9％/81.6％，任务准确性和能源效率提高了5.9％/4.7％和3.4 $ \ times $/3.2 $ \ times $。在ImagEnet-1K上，我们在单个/4步res-SNN-104上获得了75.92％和77.08％的TOP-1精度，这是SNN的最新结果。据我们所知，这是SNN社区与大规模数据集中的ANN相比，SNN社区取得了可比甚至更好的性能。我们的工作阐明了SNN作为支持SNN的各种应用程序的一般骨干的潜力，在有效性和效率之间取得了巨大平衡。

translated by 谷歌翻译

Learning Representations for Hyper-Relational Knowledge Graphs

Harry Shomer , Wei Jin , Juanhui Li , Yao Ma , Jiliang Tang

分类：机器学习 | 人工智能

2022-08-30

知识图（kgs）因其学习单一关系事实的表示能力而获得了突出。最近，研究重点是建模超级关系的事实，这些事实超出了单一关系事实的限制，使我们能够代表更复杂和现实的信息。但是，现有的超级关系中学习表征的方法主要集中于增强从预选赛到基础三元组的沟通，同时忽略了从基本三重限制者到资格赛的信息流。这可能会导致次级预选赛表示，尤其是在提出大量预选赛时。它促使我们设计一个利用多个聚合器来学习超级关系事实的表示框架：从基本三重的角度来看，一个框架从资格符的角度来看。实验证明了我们框架对多个数据集的超相关知识图完成的有效性。此外，我们进行了一项消融研究，以验证各个组件在我们的框架中的重要性。可以在\ url {https://github.com/harryshomer/quad}找到复制我们的结果的代码。

translated by 谷歌翻译

HTML版本

Enhancing Graph Contrastive Learning with Node Similarity

Hongliang Chi , Yao Ma

分类：机器学习

2022-08-13

图神经网络（GNN）在学习图表表示方面取得了巨大成功，从而促进了各种与图形相关的任务。但是，大多数GNN方法都采用监督的学习设置，由于难以获得标记的数据，因此在现实世界中并不总是可行的。因此，图表自学学习一直在吸引越来越多的关注。图对比度学习（GCL）是自我监督学习的代表性框架。通常，GCL通过将语义上相似的节点（阳性样品）和不同的节点（阴性样品）与锚节点进行对比来学习节点表示。没有访问标签，通常通过数据增强产生阳性样品，而负样品是从整个图中均匀采样的，这导致了亚最佳目标。具体而言，数据增强自然限制了该过程中涉及的正样本的数量（通常只采用一个阳性样本）。另一方面，随机采样过程不可避免地选择假阴性样品（样品与锚共享相同的语义）。这些问题限制了GCL的学习能力。在这项工作中，我们提出了一个增强的目标，以解决上述问题。我们首先引入了一个不可能实现的理想目标，该目标包含所有正样本，没有假阴性样本。然后，基于对阳性和负样品进行采样的分布，将这个理想的目标转化为概率形式。然后，我们以节点相似性对这些分布进行建模，并得出增强的目标。各种数据集上的全面实验证明了在不同设置下提出的增强目标的有效性。

translated by 谷歌翻译

ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding

Bingning Wang , Feiyang Lv , Ting Yao , Yiming Yuan , Jin Ma , Yu Luo , Haijin Liang

分类：自然语言处理 | 计算机视觉

2022-08-05

视觉问题回答是自然语言和愿景理解的重要任务。但是，在大多数公众视觉问题上回答了诸如VQA，CLEVR之类的数据集，这些问题是针对给定图像的特定于“她的眼睛是什么颜色？”的人类产生的。人类产生的众包问题相对简单，有时对某些实体或属性有偏见。在本文中，我们介绍了一个基于Image-Chiqa的新问题回答数据集。它包含Internet用户发布的现实查询，并结合了几个相关的开放域图像。系统应确定图像是否可以回答问题。与以前的VQA数据集不同，这些问题是现实世界中独立的查询，这些查询更加各种和无偏见。与先前的图像回程或图像捕获数据集相比，Chiqa不仅衡量了相关性，而且还可以衡量答案性，这需要更细粒度的视力和语言推理。 Chiqa包含超过40k的问题和超过200k的问题图像对。将三级2/1/0标签分配给每个对，指示完美的答案，部分答案和无关紧要。数据分析表明，Chiqa需要对语言和视觉有深入的了解，包括接地，比较和阅读。我们评估了几种最先进的视觉语言模型，例如ALBEF，表明仍然有一个很大的改进奇卡的空间。

translated by 谷歌翻译

BiFeat: Supercharge GNN Training via Graph Feature Quantization

Yuxin Ma , Ping Gong , Jun Yi , Zhewei Yao , Minjie Wang , Cheng Li , Yuxiong He , Feng Yan

分类：机器学习

2022-07-29

图形神经网络（GNN）是具有无核数据的应用的有前途的方法。但是，具有数亿节点的大规模图上的培训GNN既是资源又是耗时的。与DNN不同，GNN通常具有更大的内存足迹，因此GPU内存能力和PCIE带宽是GNN培训中的主要资源瓶颈。为了解决此问题，我们提出分叉：一种图形量化方法，通过显着减少内存足迹和PCIE带宽要求来加速GNN训练，以便GNN可以充分利用GPU计算功能。我们的关键见解是，与DNN不同，GNN不太容易发生量化引起的输入特征的信息丢失。我们确定图形特征量化中的主要准确性影响因素，从理论上证明，分叉训练会收敛到网络，在该网络中，损失在未压缩网络的最佳损失的$ \ epsilon $之内。我们使用几种流行的GNN模型和数据集对分叉进行了广泛的评估，包括最大的公共图数据集MAG240M上的图形。结果表明，分叉达到30以上的压缩率，并在边际准确性损失的情况下提高了GNN训练速度200％-320％。特别是，分叉在一小时内仅使用四个GPU在MAG240M上的训练图来实现记录。

translated by 谷歌翻译

Generative Domain Adaptation for Face Anti-Spoofing

Qianyu Zhou , Ke-Yue Zhang , Taiping Yao , Ran Yi , Kekai Sheng , Shouhong Ding , Lizhuang Ma

分类：计算机视觉

2022-07-20

基于无监督的域适应性（UDA），由于目标情景的表现有希望的表现，面部抗散热器（FAS）方法引起了人们的注意。大多数现有的UDA FAS方法通常通过对齐语义高级功能的分布来拟合受过训练的模型。但是，对未标记的目标域的监督不足，低水平特征对齐降低了现有方法的性能。为了解决这些问题，我们提出了UDA FAS的新颖观点，该视角将目标数据直接适合于模型，即，通过图像翻译将目标数据风格化为源域样式，并进一步将风格化的数据提供给训练有素的数据分类的源模型。提出的生成域适应（GDA）框架结合了两个精心设计的一致性约束：1）域间神经统计量的一致性指导发生器缩小域间间隙。 2）双层语义一致性确保了风格化图像的语义质量。此外，我们提出了域内频谱混合物，以进一步扩大目标数据分布，以确保概括并减少域内间隙。广泛的实验和可视化证明了我们方法对最新方法的有效性。

translated by 谷歌翻译

Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing

Qianyu Zhou , Ke-Yue Zhang , Taiping Yao , Ran Yi , Shouhong Ding , Lizhuang Ma

分类：计算机视觉

2022-07-20

随着各种面部表现攻击不断出现，基于域概括（DG）的面部抗散热（FAS）方法引起了人们的注意。现有的基于DG的FAS方法始终捕获用于概括各种看不见域的域不变功能。但是，他们忽略了单个源域的歧视性特征和不同域的不同域特异性信息，并且训练有素的模型不足以适应各种看不见的域。为了解决这个问题，我们提出了专家学习（AMEL）框架的自适应混合物，该框架利用了特定于域的信息以适应性地在可见的源域和看不见的目标域之间建立链接，以进一步改善概括。具体而言，特定领域的专家（DSE）旨在研究歧视性和独特的域特异性特征，以作为对共同域不变特征的补充。此外，提出了动态专家聚合（DEA），以根据与看不见的目标域相关的域相关的每个源专家的互补信息来自适应地汇总信息。并结合元学习，这些模块合作，可适应各种看不见的目标域的有意义的特定于域特异性信息。广泛的实验和可视化证明了我们对最先进竞争者的方法的有效性。

translated by 谷歌翻译